Search CORE

114 research outputs found

NETASA: neural network based prediction of solvent accessibility

Author: Ahmad Shandar
Gromiha M. Michael
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2002
Field of study

Motivation: Prediction of the tertiary structure of a protein from its amino acid sequence is one of the most important problems in molecular biology. The successful prediction of solvent accessibility will be very helpful to achieve this goal. In the present work, we have implemented a server, NETASA for predicting solvent accessibility of amino acids using our newly optimized neural network algorithm. Several new features in the neural network architecture and training method have been introduced, and the network learns faster to provide accuracy values, which are comparable or better than other methods of ASA prediction. Results: Prediction in two and three state classification systems with several thresholds are provided. Our prediction method achieved the accuracy level upto 90% for training and 88% for test data sets. Three state prediction results provide a maximum 65% accuracy for training and 63% for the test data. Applicability of neural networks for ASA prediction has been confirmed with a larger data set and wider range of state thresholds. Salient differences between a linear and exponential network for ASA prediction have been analysed

Universiti Putra Malaysia Institutional Repository

PINT: Protein–protein Interactions Thermodynamic Database

Author: Gromiha M. Michael
Kumar M. D. Shaji
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

The first release of Protein–protein Interactions Thermodynamic Database (PINT) contains >1500 data of several thermodynamic parameters along with sequence and structural information, experimental conditions and literature information. Each entry contains numerical data for the free energy change, dissociation constant, association constant, enthalpy change, heat capacity change and so on of the interacting proteins upon binding, which are important for understanding the mechanism of protein–protein interactions. PINT also includes the name and source of the proteins involved in binding, their Protein Information Resource, SWISS-PROT and Protein Data Bank (PDB) codes, secondary structure and solvent accessibility of residues at mutant positions, measuring methods, experimental conditions, such as buffers, ions and additives, and literature information. A WWW interface facilitates users to search data based on various conditions, feasibility to select the terms for output and different sorting options. Further, PINT is cross-linked with other related databases, PIR, SWISS-PROT, PDB and NCBI PUBMED literature database. The database is freely available a

Crossref

PubMed Central

TMBETA-NET: discrimination and prediction of membrane spanning β-strands in outer membrane proteins

Author: Ahmad Shandar
Gromiha M. Michael
Suwa Makiko
Publication venue: Oxford University Press
Publication date: 27/06/2005
Field of study

We have developed a web-server, TMBETA-NET for discriminating outer membrane proteins and predicting their membrane spanning β-strand segments. The amino acid compositions of globular and outer membrane proteins have been systematically analyzed and a statistical method has been proposed for discriminating outer membrane proteins. The prediction of membrane spanning segments is mainly based on feed forward neural network and refined with β-strand length. Our program takes the amino acid sequence as input and displays the type of the protein along with membrane-spanning β-strand segments as a stretch of highlighted amino acid residues. Further, the probability of residues to be in transmembrane β-strand has been provided with a coloring scheme. We observed that outer membrane proteins were discriminated with an accuracy of 89% and their membrane spanning β-strand segments at an accuracy of 73% just from amino acid sequence information. The prediction server is available at

Crossref

PubMed Central

CUPSAT: prediction of protein stability upon point mutations

Author: Gromiha M. Michael
Parthiban Vijaya
Schomburg Dietmar
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

CUPSAT (Cologne University Protein Stability Analysis Tool) is a web tool to analyse and predict protein stability changes upon point mutations (single amino acid mutations). This program uses structural environment specific atom potentials and torsion angle potentials to predict ΔΔG, the difference in free energy of unfolding between wild-type and mutant proteins. It requires the protein structure in Protein Data Bank format and the location of the residue to be mutated. The output consists information about mutation site, its structural features (solvent accessibility, secondary structure and torsion angles), and comprehensive information about changes in protein stability for 19 possible substitutions of a specific amino acid mutation. Additionally, it also analyses the ability of the mutated amino acids to adapt the observed torsion angles. Results were tested on 1538 mutations from thermal denaturation and 1603 mutations from chemical denaturation experiments. Several validation tests (split-sample, jack-knife and k-fold) were carried out to ensure the reliability, accuracy and transferability of the prediction method that gives >80% prediction accuracy for most of these validation tests. Thus, the program serves as a valuable tool for the analysis of protein design and stability. The tool is accessible from the link

The role of stabilization centers in protein thermal stability

Author: Gromiha M. Michael
Magyar Csaba
Simon István
Sávoly Zoltán
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

AbstractThe definition of stabilization centers was introduced almost two decades ago. They are centers of noncovalent long range interaction clusters, believed to have a role in maintaining the three-dimensional structure of proteins by preventing their decay due to their cooperative long range interactions. Here, this hypothesis is investigated from the viewpoint of thermal stability for the first time, using a large protein thermodynamics database. The positions of amino acids belonging to stabilization centers are correlated with available experimental thermodynamic data on protein thermal stability. Our analysis suggests that stabilization centers, especially solvent exposed ones, do contribute to the thermal stabilization of proteins

Elsevier - Publisher Connector

Repository of the Academy's Library

Identification of DNA-binding proteins using support vector machines and evolutionary profiles

Author: Gromiha Michael M
Kumar Manish
Raghava Gajendra PS
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Identification of DNA-binding proteins is one of the major challenges in the field of genome annotation, as these proteins play a crucial role in gene-regulation. In this paper, we developed various SVM modules for predicting DNA-binding domains and proteins. All models were trained and tested on multiple datasets of non-redundant proteins. Results SVM models have been developed on DNAaset, which consists of 1153 DNA-binding and equal number of non DNA-binding proteins, and achieved the maximum accuracy of 72.42% and 71.59% using amino acid and dipeptide compositions, respectively. The performance of SVM model improved from 72.42% to 74.22%, when evolutionary information in form of PSSM profiles was used as input instead of amino acid composition. In addition, SVM models have been developed on DNAset, which consists of 146 DNA-binding and 250 non-binding chains/domains, and achieved the maximum accuracy of 79.80% and 86.62% using amino acid composition and PSSM profiles. The SVM models developed in this study perform better than existing methods on a blind dataset. Conclusion A highly accurate method has been developed for predicting DNA-binding proteins using SVM and PSSM profiles. This is the first study in which evolutionary information in form of PSSM profiles has been used successfully for predicting DNA-binding proteins. A web-server DNAbinder has been developed for identifying DNA-binding proteins and domains from query amino acid sequences <url>http://www.imtech.res.in/raghava/dnabinder/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

FOLD-RATE: prediction of protein folding rates from amino acid sequence

Author: Gromiha M. Michael
Selvaraj S.
Thangakani A. Mary
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

We have developed a web server, FOLD-RATE, for predicting the folding rates of proteins from their amino acid sequences. The relationship between amino acid properties and protein folding rates has been systematically analyzed and a statistical method based on linear regression technique has been proposed for predicting the folding rate of proteins. We found that the classification of proteins into different structural classes shows an excellent correlation between amino acid properties and folding rates of two and three-state proteins. Consequently, different regression equations have been developed for proteins belonging to all-α, all-β and mixed class. We observed an excellent agreement between predicted and experimentally observed folding rates of proteins; the correlation coefficients are, 0.99, 0.97 and 0.90, respectively, for all-α, all-β and mixed class proteins. The prediction server is freely available at

CiteSeerX

PubMed Central

Computational modeling of protein mutant stability: analysis and optimization of statistical potentials and structural features reveal insights into prediction model development

Author: Abhinandan Madenhalli
Gromiha M Michael
Parthiban Vijaya
Schomburg Dietmar
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Understanding and predicting protein stability upon point mutations has wide-spread importance in molecular biology. Several prediction models have been developed in the past with various algorithms. Statistical potentials are one of the widely used algorithms for the prediction of changes in stability upon point mutations. Although the methods provide flexibility and the capability to develop an accurate and reliable prediction model, it can be achieved only by the right selection of the structural factors and optimization of their parameters for the statistical potentials. In this work, we have selected five atom classification systems and compared their efficiency for the development of amino acid atom potentials. Additionally, torsion angle potentials have been optimized to include the orientation of amino acids in such a way that altered backbone conformation in different secondary structural regions can be included for the prediction model. This study also elaborates the importance of classifying the mutations according to their solvent accessibility and secondary structure specificity. The prediction efficiency has been calculated individually for the mutations in different secondary structural regions and compared. Results Results show that, in addition to using an advanced atom description, stepwise regression and selection of atoms are necessary to avoid the redundancy in atom distribution and improve the reliability of the prediction model validation. Comparing to other atom classification models, Melo-Feytmans model shows better prediction efficiency by giving a high correlation of 0.85 between experimental and theoretical ΔΔG with 84.06% of the mutations correctly predicted out of 1538 mutations. The theoretical ΔΔG values for the mutations in partially buried <it>β</it>-strands generated by the structural training dataset from PISCES gave a correlation of 0.84 without performing the Gaussian apodization of the torsion angle distribution. After the Gaussian apodization, the correlation increased to 0.92 and prediction accuracy increased from 80% to 88.89% respectively. Conclusion These findings were useful for the optimization of the Melo-Feytmans atom classification system and implementing them to develop the statistical potentials. It was also significant that the prediction efficiency of mutations in the partially buried <it>β</it>-strands improves with the help of Gaussian apodization of the torsion angle distribution. All these comparisons and optimization techniques demonstrate their advantages as well as the restrictions for the development of the prediction model. These findings will be quite helpful not only for the protein stability prediction, but also for various structure solutions in future.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Srinivasan (1962-2021) in Bioinformatics and beyond

Author: Gromiha M Michael
Orengo Christine
Sowdhamini Ramanathan
Thornton Janet
Publication venue: 'Oxford University Press (OUP)'
Publication date: 03/02/2022
Field of study

UCL Discovery